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We present accurate metallicity measurements for 121 clamped Lya systems at 0.5 < z < 5 including 
ss 50 new measurements from our recently published Echcllettc Spectrograph and Imager surveys. This 
dataset is analysed to determine the age-metallicity relation of neutral gas in the universe. Contrary 
to previous datasets this sample shows statistically significant evolution in the mean metallicity. The 
best linear fit rate to metallicity vs. redshift is —0.26 ± 0.06 dex corresponding to approximately a 
factor of 2 every Gyr at z = 3. The DLA continue to maintain a floor in metallicity of sa 1/700 solar 
independent of observational effects. This metallicity threshold limits the prevalence of primordial gas 
in high redshift galaxies and stresses the correspondence between damped systems and star formation 
(i.e. galaxy formation). This floor is significantly offset from the metallicity of the Lya forest and 
therefore we consider it to be more related to active star formation within these galaxies than scenarios 
of enrichment in the very early universe. Finally, we comment on an apparent 'missing metals problem': 
the mean metallicity of the damped systems is w 10 x lower than the value expected from their observed 
star formation history. This problem is evident in current theoretical treatments of chemical evolution 
and galaxy formation; it may indicate a serious flaw in our understanding of the interplay between star 
formation and metal production. 



Subject headings: galaxies: abundances 

1. INTRODUCTION 

For the past decade researchers have observed the damped 
Lya systems (DLA) - quasar absorption line systems with 
HI column density iV(HI) > 2 x 10 20 cm -2 - to trace cos- 
mological properties of neutral gas in the early universe. 
With moderate resolution spectroscopy, for example, ob- 
servers have taken a census of the HI mass density ^l gas 
from z — to 5 and found damped Lya systems to com- 
prise most of the neutral gas out to at least z = 4 (Wolfe 
et al. 1986; Lanzetta et al. 1995; Wolfe et al. 1995; 
Storrie-Lombardi and Wolfe 2000; Rao & Turnshek 2000; 
Peroux et al. 2001; Djorgovski et al. 2003). By combin- 
ing these observations with higher resolution spectroscopy 
of metal-line transitions, one tracks the metal enrichment 
of the universe in neutral gas (Pettini et al. 1994, 1999; 
Prochaska & Wolfe 2000). If the individual DLA metal- 
licity measurements are weighted by their corresponding 
H I column densities, the resulting mean represents a cos- 
mological quantity: 0, meta i s /ft gas , which equals the mass- 
weighted metallicity < Z > of neutral gas in the universe 
(Lanzetta et al. 1995). Aside from selection biases, this 
statistic is independent of any physical property (e.g. mass, 
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galaxies: chemical evolution — quasars : absorption lines 

morphology) of the damped systems surveyed and, there- 
fore, it presents a fundamental test for theories of chemical 
evolution (e.g. Pei & Fall 1995; Somerville, Primack, & 
Faber 2001). In turn, these observations constrain the 
star formation history of the universe and help describe 
the interplay between nucleosynthesis and gas enrichment 
in high redshift galaxies. 

Because < Z > is a N (HI)- weighted measure, the uncer- 
tainty in this statistic is dominated by the damped systems 
with the highest product of metallicity and HI column 
density. This is analogous to measurements of Q gas where 
damped systems with the largest iV(HI) dominate the un- 
certainty 8 . For this reason, previous metallicity samples 
were susceptible to severe sample variance, systematic er- 
ror, and potential outliers. With the goal of reducing the 
effect of small number statistics, we initiated a program 
(Prochaska, Gawiser, & Wolfe 2001) with the Echellette 
Spectrograph and Imager (ESI; Sheinis et al. 2002) to 
rapidly increase the sample of high z damped systems with 
accurate metallicity measurements. In comparison with 
previous echelle observations, this instrument and observ- 
ing strategy have led to a nearly lOx increase in efficiency. 
In roughly 5 nights observing time, we have doubled the 
number of z > 1.5 damped systems and nearly quadrupled 
the systems at z > 3 (Prochaska et al. 2003a,b). In this 
Letter, we report the principal results on chemical evolu- 
tion from our ESI surveys of the damped Lya systems. 
Combining these new measurements with w 50 damped 
systems drawn from the literature, our analysis includes a 
sample of over 100 damped Lya systems from z = 0.5 to 
5. 



8 This effect is less severe at z > 4 where there are fewer DLA with 
W(HI) > 10 21 cm" 2 (see Peroux ct al. 2001) 
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2. THE SAMPLE 

At present over 300 damped Lya systems have been 
identified in the literature (Curran et al. 2002). Of these, 
approximately half provide a metallicity estimate. Unfor- 
tunately, these metallicity measurements are derived from 
a variety of telescopes with a range of instrumentation and 
therefore form a heterogeneous sample of data (e.g. varying 
resolution, wavelength coverage, and S/N). In this Letter, 
we primarily restrict 9 our analysis to damped Lya sys- 
tems observed on the current generation of large telescopes 
(e.g. Keck and the VLT) with high-resolution (R > 5000), 
high S/N (> 15 per pixel) spectra. Although this is a 
subjective observational criterion, there are many cases 
of understated statistical error and under-appreciated sys- 
tematic error in the literature for data of poorer quality. 
We also limit the analysis to systems satisfying the strict 
AT(HI) > 2 x 10 20 cm~ 2 criterion. This practice facilitates 
comparisons with statistical surveys of the damped sys- 
tems and simplifies comparisons with theoretical models. 

Table 1 lists the name, z a b s , iV(HI), metallicity [M/H], 
Fe abundance [Fe/H], and reference for the 121 DLA com- 
prising the complete sample. In columns 4 and 6 we list 
two 'flags' which describe the derivation of the [M/H] and 
[Fc/H] values. The latter are determined primarily from 
Fell transitions or when necessary CrII, Nill, or A1II 
transitions offset by their typical [X/Fe] value (e.g. Prochaska 
& Wolfe; hereafter, PW02). When possible, we adopt an 
[M/H] value based on an observed a-element (e.g. Si, S) 
or Zn. These elements are mildly or non-refractory and 
should exhibit minimal depiction in the damped Lya sys- 
tems. Furthermore, these elements exhibit solar relative 
abundances with few exceptions (PW02). In cases where 
there are only limits reported to these abundances and the 
limits span an interval less than 0.4 dex (e.g. ±0.2 dex), we 
adopt the central value and an error encompassing the two 
limits. As a last resort (8 cases), we adopt an [M/H] value 

9 The exceptions are the z < 1.5 damped systems which include 
an estimate from X-Ray observations (Junkkarinen et al. 2003, in 
preparation) . 



calculated from [Fe/H] assuming an offset of 0.4 dex. We 
consider this robust because the median and mean [a/Fe] 
and [Zn/Fe] values for DLA at z > 2 arc all close to 0.4 dex 
(PW02). 

We present the full set of [M/H] values as a function of 
z a b s in Figure 1. The dark, unbinncd points identify the 
data drawn from our recent ESI surveys while the light 
unbinncd points are drawn from our echelle measurements 
and those of others reported in the literature. The error 
bars represent the statistical uncertainty in these measure- 
ments. For the majority of these observations, this error is 
dominated by the uncertainty in iV(HI) measurements, an 
uncertainty that is typically not rigorously derived. The 
quoted values tend to overestimate the statistical error and 
underestimate the systematic error from linc-blcnding and 
continuum placement. In the following analysis, we adopt 
a minimum error of 0.1 dex for all metallicity measure- 
ments. It is important to emphasize, however, that none 
of the analysis in this Letter is sensitive to statistical error 
related to individual measurements. 

3. ANALYSIS 

In Figure 1 we overplot the unweighted, logarithmic 
mean metallicity (i.e. the mean of [M/H] values) in 5 red- 
shift bins at z > 1.5 defined to have equal numbers of 
DLA and one redshift bin with z = [0.5, 1.5] covering the 
damped systems with Lya profiles at A Q fc s < 3100A. This 
statistic was chosen to best represent the evolution in the 
metallicity of a 'typical' damped system as seen by eye in 
the log- linear plot in Figure 1. The vertical error bars re- 
port 95% c.l. uncertainty in this mean value as determined 
from a bootstrap error analysis. Meanwhile, the redshift 
marked for each bin refers to the median z aos value. Even 
an eyeball analysis of the figure reveals a statistically sig- 
nificant evolution in the unweighted mean metallicity. Per- 
forming a least-squares linear fit to the binned data (which 
best accounts for the scatter in the measurements) , we cal- 
culate a slope m = —0.26 ± 0.06 dex/Az and zero-point 
[M/H] = -0.67 ± 0.17 dex. The results indicate that the 
metallicity of the 'average' galaxy is increasing with de- 
creasing redshift with an e-folding time of approximately 
1 unit redshift, i.e., a 2x decrease per Gyr at z <~ 3. This 
statistically significant evolution in the unweighted mean 
contrasts with our previous samples, although Vladilo et 
al. (2000) suggested such a trend at z < 3 in a much 
smaller sample. We note that the y-intercept of the best- 
fit line is nearly 4c below solar metallicity. This may be 
an indication of small sampling at low z or our assumption 
that the logarithmic mean metallicity evolves linearly with 
redshift. On the other hand, it is possible that a cross- 
section selected sample of HI gas at z = (e.g. Rosenberg 
& Schneider 2003) would show a sub-solar metallicity due 
to the contributions from dwarf and low surface-brightness 
galaxies. 

The cosmological mean metallicity < Z > was com- 
puted using < Z >= logE 4 10l M / ff ] W(Hiy £ 2 A/(HI) J 
and is presented in Figure 2 in the same six bins. In the 
figure, the individual measurements are now represented 
by squares whose areas scale with the TV (HI) values of the 
DLA. Because the < Z > statistic is dominated by the 
DLA with the largest iV(HI) and [M/H] values, its mea- 
surement uncertainty will always be dominated by sample 
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Fig. 1. — The [M/H], z abs values for the 121 damped systems comprising our complete sample. The darker points indicate the new values 
from our ESI surveys while the lighter data points are all taken from the literature (primarily HIRES and UVES observations). The symbols 
for the points indicate the origin of the [M/H] values: a-element measurement (filled squares); Zn measurement (filled circle); ct+Zn limits 
(filled triangle); Fe measurement + 0.4 dex (filled star); Fc limits + 0.4 dex (open circle). A trend of lower metallicity at higher redshift is 
clearly evident in the figure. We evaluate this trend by measuring the unweighted mean metallicity in the 6 bins (solid star points with error 
bars). A least-squares fit to these data points yields a best fit slope of m = —0.26 ± 0.06 dex. The error bars plotted on the binned data 
represent 95% c.l. derived from a bootstrap error analysis. 
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Fig. 2. — The [M/H], z a t, s pairs are plotted as open squares where the area of each square is scaled to the 7V(HI) value of the damped 
system. This aids the eye in determining which sightlincs dominate the iV(HI)-weighted, cosmic mean metallicity. The cosmic metallicity 
< Z > is plotted for 6 bins with 95%c.l. uncertainties given by a bootstrap analysis. Similar to the unweighted mean, we find a best fit slope 
of m = —0.25 ± 0.07 dex assuming a linear solution to the < Z > vs. z ab3 values. 
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variance as opposed to statistical error. We estimate this 
variance through the bootstrap technique and the vertical 
error bars refer to 95% c.l. on the mean. The bootstrap 
technique assesses the sample variance under the assump- 
tion that the observed distribution is not severely different 
from the true, underlying distribution. If the data sample 
is too small, even a single outlier could significantly change 
the central value of the mean statistic significantly beyond 
the error implied by a bootstrap analysis. While this issue 
was a concern in previous analyses, the results at z « 2 — 4 
are now robust to all but the unlikeliest of outliers. For ex- 
ample, the discovery of a system with TV (HI) = 10 22 cm~ 2 
and 1/3 solar metallicity would only increase < Z > by 
« +0.2 dex in the z ~ 2 bin or +0.25 dex at z ~ 3.5 
(see also Prochaska 2003b). Such an outlier would lie 
nearly a factor of 10 off the current AT(HI), [M/H] dis- 
tribution and it would necessitate an entire population of 
DLA separate from the 100 sightlines presented here. This 
population could only exist if the current sample is signif- 
icantly biased by selection effects (e.g. dust obscuration), 
an assertion unsupported by any recent observational test 
(e.g. Ellison et al. 2001; PW02). 

Performing a least-squares fit to the < Z > values, we 
find m = -0.25 + 0.07 and b = -0.61 + 0.21 dex. Note that 
the inclusion of the z < 1.5 bin has minimal impact on this 
analysis. Somewhat to our surprise, especially given the 
impression from a visual inspection of Figures 1 and 2, the 
best fit slope to the < Z > values is identical within statis- 
tical uncertainties to the value derived for the unweighted 
mean. This marks the first demonstration of evolution in 
< Z > at greater than 3cr confidence. In this case, our 
analysis implies that the mean metallicity of the universe 
in neutral gas is approximately doubling every billion years 
from z = 4 to 2. 

4. DISCUSSION 

The history of metallicity in the universe provides a po- 
tentially powerful constraint on models of galaxy forma- 
tion, and it is tempting to draw additional conclusions 
from the distribution of points in the figures. The scat- 
ter amongst the points in each bin does not appear to 
evolve with redshift, implying that at each epoch DLA 
represent a set of systems at various stages in their for- 
mation. The interpretation of this is complicated by the 
unknown breakdown of the observed scatter into scatter 
amongst galaxies and scatter within galaxies. Sightlines 
through the DLA provide pinpricks of transverse diame- 
ter < 10 pc through the absorbing system. Although DLA 
abundances show remarkable uniformity amongst multiple 
components along these sightlines (Prochaska 2003a), we 
do not know how well the metallicity of the entire galaxy 
is sampled by these measurements. These characteristics 
of an evolving mean with nearly constant scatter would 
like pose a difficult challenge to scenarios which would de- 
scribe damped Lya systems as a transient phase in the 
formation of galaxies. 

Another important characteristic of the observed distri- 
bution of [M/H] ,z pairs is the areas of rcdshift-mctallicity 
space that remain unoccupied. In particular, note the ab- 
sence of any DLA at [M/H] < —3 at all redshifts and the 
lack of DLA at [M/H]> at all redshifts. The eye can 
identify a gradual increase in the maximum and minimum 



metallicity with redshift at roughly the same rate of evo- 
lution as that of the unweighted logarithmic mean, but it 
would be dangerous to interpret this since the max and 
min are sensitive to outliers. Regarding the lower limit 
to the DLA mctallicities, it appears possible that we will 
never identify a damped Lya system with [M/H] < —3, a 
value which significantly exceeds our detection limit. This 
lower bound has important implications for the presence 
of primordial gas (zero metallicity) within these galax- 
ies. If primordial gas with significant surface density and 
cross-section exists in high redshift galaxies, then it is al- 
ways surrounded by metal-enriched gas yielding a mass- 
weighted metallicity exceeding 1/1000 solar. Alternatively, 
primordial gas may not exist in the neutral phase within 
high z galaxies. It may be possible to distinguish be- 
tween these scenarios by observing the higher order Ly- 
man lines to pursue metallicity measurements of individual 
DLA components. 

This metallicity floor also limits the contribution to DLA 
from gas 'clouds' which are unrelated to galaxies (e.g. 
overdense regions associated with large-scale structures). 
The metallicity floor requires that this gas was enriched 
to [M/H] > —3, presumably by a nearby galaxy. To 
match the observed metallicities would probably require 
fine-tuning, which argues against the likelihood that DLA 
are anything but galaxies in the early universe. We also 
emphasize that the lower bound to the metallicity of the 
DLA ([M/H] w —2.6) is significantly higher than the metal- 
licities typically attributed to the Lya forest (e.g. Songaila 
2001). This suggests that the lower bound to the DLA 
values may not be simply related to the physical processes 
which have enriched the Lya forest (e.g. pollution from 
Population III stars) . We contend that the offset between 
the DLA mctallicities and the Lya forest indicates the 
enrichment of DLA gas at all metallicity is dominated 
by metal production within these galaxies. It is possible, 
however, that Pop III scenarios would predict larger pre- 
enrichment in regions of higher overdensity, perhaps even 
to the lower bound observed for the DLA. 

A key implication of our results comes to light in their 
comparison with the recent determination of DLA star for- 
mation rates (Wolfe, Prochaska, & Gawiser 2003). Al- 
though it is not yet possible to compare the metallicity 
and integrated star formation rates of individual DLA, it is 
possible to compare the average DLA metallicity at z — 2.5 
with the integral under the cosmic star formation history 
at z > 2.5 (Wolfe, Gawiser, & Prochaska 2003). The 
comparison illustrates a "missing metals problem" for the 
DLA: the integrated SFR's measured for the DLA com- 
bined with a standard IMF and yield of heavy elements 
imply 10 x the mass density of metals observed in the 
damped systems. This conflict suggests scenarios where 
the metals are ejected from the galaxy via supernovae feed- 
back or where the metals are sequestered within the star 
formation regions for significant timescales (e.g. within a 
galactic bulge; Wolfe, Gawiser, & Prochaska 2003). It is 
also possible that the mean metallicities of the typical DLA 
systems are not representative of those in the actual star 
forming regions. Observations of absorption systems asso- 
ciated with GRBs, which probably do probe the star form- 
ing regions themselves, suggest higher metallicities which 
may be more typical of disk stellar populations (see, e.g., 
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Mirabal et al. 2002, Savaglio et al. 2003, or Djorgovski 
et al. 2003, and references therein). A fundamental un- 
certainty in comparing the actively star forming regions 
sampled by GRBs and that of the majority of the neutral 
gas in the universe sampled by the DLA is our lack of un- 
derstanding of the mixing efficiency of metal enrichment. 
More detailed modeling is needed before this issue can be 
completely resolved. 

This problem is evident in recent theoretical models of 
chemical evolution which consider the mean metallicity of 
the universe in neutral gas (Pei, Fall, & Hauser 1999; 
Somerville, Primack, & Faber 2001; Tissera et al. 2001; 
Mathlin et al. 2001; Cen et al. 2003). These treatments 
range from analytic chemical evolution scenarios (i.e. inde- 
pendent of galaxy formation models) to full-blown hydro- 
dynamic, numerical simulations. The analyses appear to 
offer a wide range of theoretical values for the mean metal- 
licity of the universe in neutral gas, yet the majority of 
variation stems from two key factors: (1) the adopted (or 
predicted) star formation history; and (2) the treatment of 
selection biases due to dust obscuration. An appreciation 
of these two aspects can be obtained by following the anal- 
ysis of Somerville, Primack, & Faber (2001) who focused 
on the stellar properties of the Lyman Break Galaxies. In 
addition to discussing their predictions for the star forma- 
tion history of the early universe, the authors examined 
the mean metallicity in cold gas. In all of their scenarios, 
the values exceeded the damped Lya observations by at 
least a factor of 3 at all redshifts z > 2. This discrepancy is 
a restatement of the "missing metals problem" described 
above. At present, the star formation history implied by 
various studies of high-z galaxies (including DLA them- 
selves) produce too many metals in comparison with the 
DLA. This is a universal characteristic of these chemical 
evolution models, even in those scenarios which underpre- 
dict the star formation history at z > 2 (e.g. Pei, Fall, & 
Hauser 1999; Mathlin et al. 2001). 

Several of the theoretical treatments, however, report 
successes in matching previous DLA metallicity samples. 
In all of these cases, the authors introduced significant 
selection biases owing to the effects of dust obscuration. 
Undoubtedly, dust obscuration plays a role in observa- 
tions of the damped Lya systems, especially at lower red- 
shift where the gas metallicity and dust content are pre- 
sumably highest. At z > 2, however, current samples 
of DLA toward radio-selected quasars exhibit no signifi- 
cant difference from optically-selected samples (Ellison et 
al. 2001b). Furthermore, PW02 found no dependence 
between the inferred dust opacity of observed DLA sight - 
lincs and the quasar magnitude, contrary to expectation in 
scenarios where dust obscuration is important. The orig- 
inal claims of reddening by Pei, Fall, & Bechtold (1991) 
have not been confirmed by larger, homogeneous quasar 
samples, e.g., Outram et al. (2001) found only a 2a in- 
dication of reddening in their z < 1 2dF quasar sample. 
At present, we believe there is no compelling evidence for 
dust obscuration 10 beyond its convenience as a possible 
solution to the missing metals problem. Regarding this 
aspect, we emphasize that no group has self-consistently 

10 We distinguish obscuration from depletion, since in DLA there 
is strong evidence from relative abundance ratios (e.g. Pettini et al. 
1994; PW02) for the depletion of refractory elements onto grains. 



matched the observed star formation history of the early 
universe with the metallicity measurements observed for 
the damped systems even when allowing for dust obscu- 
ration. Current theories of star formation and metal pro- 
duction in high redshift galaxies are missing a vital aspect 
of the processes. 

Looking toward the future, observations of 300 addi- 
tional DLA arc needed to bring the statistical error at 
z = 2 to 4 down to the level associated with systematic ef- 
fects such as differential depletion. With the introduction 
of ESI and similar future instrumentation, we expect this 
will be accomplished within the next decade. Significant 
gains, meanwhile, can be made very quickly at z < 1.5 
and z > 4 owing to the small sample size. Unfortunately, 
measurements at z < 1.5 arc slowed by the requirement 
of ultraviolet observations. This is a particularly challeng- 
ing aspect of damped Lya research because this redshift 
range corresponds to over half the age of the current uni- 
verse. While the COS spectrograph on the Hubble Space 
Telescope will improve the situation, a comprehensive view 
of metal enrichment over the past 8 Gyr will require the 
launch of a 4m-class ultraviolet telescope (Prochaska et al. 
2002). 
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